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The SARS coronavirus main peptidase (SARS-CoV M pro ) plays an essential 
role in the life-cycle of the virus and is a primary target for the development 
of anti-SARS agents. Here, we report the crystal structure of M pro at a 
resolution of 1.82 A, in space group P2 1 at pH 6.0. In contrast to the 
previously reported structure of M pro in the same space group at the same 
pH, the active sites and the SI specificity pockets of both protomers in the 
structure of M pro reported here are in the catalytically competent 
conformation, suggesting their conformational flexibility. We report two 
crystal structures of M pro having an additional Ala at the N terminus of each 
protomer (M£a(-i))/ both at a resolution of 2.00 A, in space group P4 3 2 1 2: one 
unbound and one bound by a substrate-like aza-peptide epoxide (APE). In 
the unbound form, the active sites and the SI specificity pockets of both 
protomers of M£a(-i) are observed in a collapsed (catalytically incompetent) 
conformation; whereas they are in an open (catalytically competent) 
conformation in the APE-bound form. The observed conformational 
flexibility of the active sites and the SI specificity pockets suggests that 
these parts of M pro exist in dynamic equilibrium. The structural data further 
suggest that the binding of APE to M pro follows an induced-fit model. The 
substrate likely also binds in an induced-fit manner in a process that may 
help drive the catalytic cycle. 

© 2006 Elsevier Ltd. All rights reserved. 

Keywords: X-ray crystallography; SARS coronavirus main peptidase; aza- 
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Introduction 

Severe acute respiratory syndrome (SARS) is a 
highly transmissible, infectious and often fatal 
disease (World Health Organization-Severe acute 
respiratory syndrome!). Since its outbreak in 2002 
and rapid spread throughout early 2003, efforts in 
the development of anti-SARS vaccines and drugs 
have taken on paramount importance. 

SARS is caused by a coronavirus (CoV); 1-3 it is an 
enveloped, positive-sense single-stranded RNA 


t http: / /www.who.int/csr/sars/en/ 

Abbreviations used: SARS, severe acute respiratory 
syndrome; CoV, coronavirus; APE, aza-peptide epoxide; 
CMK, chloromethyl ketone. 
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virus. Anti-SARS therapeutics could target any one 
of several major steps in the viral life-cycle, such as 
virus-cell interactions, virus entry, intracellular viral 
replication, virus assembly and exit. 4 Extensive 
studies have been carried out on the proteins 
involved in these steps. 5 The intracellular replication 
of CoV is mediated by a replicase complex derived 
from two virally coded polyprotein precursors, 
ppla (486 kDa) and pplab (790 kDa). 6 ' 7 The 
formation of this replicase complex requires the 
extensive processing of the two polyproteins by two 
cysteine peptidases encoded within them; namely, 
the main peptidase (M pro ), also known as the 3C-like 
peptidase (3CL pro ) because of its similarity to the 3C 
peptidases of Picornaviridae, and the accessory 
papain-like peptidase 2 (PL2 pro ), 7 which cleaves at 
three sites in the N-proximal regions of the two 
polyproteins. By contrast, M pro cleaves at 11 sites in 
the central and C-proximal regions of the two 
polyproteins, releasing key viral replication pro- 


0022-2836 /$ - see front matter © 2006 Elsevier Ltd. All rights reserved. 
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teins, such as an RNA polymerase and a helicase. 
As an essential protein, M pro is an attractive mole¬ 
cular target for the development of anti-SARS drugs. 

SARS-CoV M pro is catalytically active only in the 
homodimeric form; each protomer has a molecular 
mass of 33.8 kDa. 9 14 Considerable efforts have been 
directed to the X-ray structural studies of M pro , 
resulting in the availability of many crystal struc¬ 
tures of M pro and its variants over a pH range of 
5.9-9.0 in a variety of space groups. All of these 
structures show that the two protomers of M pro are 
oriented almost perpendicular to each other. The N- 
terminal residues 1-7 of each protomer constitute 
the N-finger, of which Arg4 was shown to be 
mandatory for the dimerization and the exhibition 
of the catalytic activity of M pro . 14 Beyond the N- 
finger, each protomer consists of three domains. 
Domain I (residues 8-101) and domain II (residues 
102-184) comprise a two-|3-barrel fold similar to that 
of the chymotrypsin-type serine peptidases. Domain 
III (residues 201-300) has five a-helices and is 
connected to domain II by a long loop (residues 
185-200). Each protomer has its own substrate¬ 
binding region situated in the cleft between domains 
I and II. Recent mutagenesis studies have confirmed 
that, similar to the main peptidases from human 
corona virus strain 229E, 15 ' 16 and porcine trans¬ 
missible gastroenteritis coronavirus, 17 SARS-CoV 
M pro is a cysteine peptidase with a Cys-His catalytic 
dyad at the active site. 16 ' 18 As suggested by the 
structure-based sequence alignment of the main 
peptidases (including their flanking residues in the 
polyproteins) from SARS-CoV and other coro¬ 
na viruses, 16 and confirmed by in vitro studies, 7 ' 9 
M pro cleaves preferentially at a consensus sequence 
for the P4 to PI' residues of substrates (the 
nomenclature is based on that used by Schechter & 
Berger 19 with the arrow indicating the cleavage site): 
(amino acid with a small side-chain)-(any amino 
acid)-Leu-Glnf (Ala, Ser, Gly). 

A number of M pro inhibitors have been proposed 
using structure-based discovery 20 ' 21 and experimen¬ 
tal screening. 22-24 The crystal structures of M pro in¬ 
hibited by some peptidomimetic compounds have 
been determined. Recently, virtual screening 
followed by experimental evaluations have identi¬ 


fied the old drug cinanserin as a strong inhibitor of 
the replication of SARS-CoV; cinanserin likely 
targets M pro . 28 

We have reported on the kinetic and structural 
characterization of the inhibition of SARS-CoV M pro 
by an aza-peptide epoxide (APE, Figure l). 29 APEs 
were synthesized as a new class of inhibitors appa¬ 
rently specific for clan CD cysteine peptidases, 30 ' 31 
including the legumains, 32 and the caspases. Each 
APE has an aza-peptide component, with an epo¬ 
xide moiety attached to the carbonyl group of the 
PI residue. The side-chain of the PI residue pre¬ 
dominantly determines the target-peptidase speci¬ 
ficity of an APE. The substituent on the epoxide C2 
atom also allows some tuning of both the inhibitory 
activity and specificity of APE towards a particular 
target peptidase. In the APE, the C a atom of the PI 
residue is replaced by a nitrogen atom, yielding an 
aza-amino acid residue. This introduces trigonal 
planar geometry to the a-atom of the PI residue 
and reduces the electrophilicity of the carbonyl C 
atom of the PI residue, thereby making the 
carbonyl group of the PI residue resistant to 
nucleophilic attack. 34 It has been proposed that 
APEs inhibit their target peptidases irreversibly by 
a mechanism in which the catalytic Cys S 7 atom 
nucleophilically attacks one of the two epoxide 
carbon atoms (C2 or C3) of APE. 30/32,33 This results 
in the opening of the conformationally strained 
epoxide ring, with the formation of a covalent bond 
between the Cys S 7 atom and the attacked APE 
atom. An APE containing Gin at PI inhibits M pro 
with a fc inact /Ki=1900(±400) M _1 s~\ In this 
reaction, the catalytic Cysl45 S 7 atom attacks the 
epoxide C3 atom of the APE. 29 


Results and Discussion 

Structure determination 

The parameters and statistics derived from data 
processing and structure refinement are summa¬ 
rized in Table 1. SARS-CoV M pro crystallizes in 
space group P2 1 in conditions slightly different from 



Figure 1 . Inhibition of SARS- 
CoV M pro by the aza-peptide epox¬ 
ides (APEs) synthesized for our 
study, Cbz-Leu-Phe-AGln-EP-Coo- 
Et. The epoxide carbon atoms are 
numbered and their stereochemis¬ 
try is omitted for simplicity. The 
proposed mechanism for the irre¬ 
versible inhibition of clan CD 
cysteine peptidases by APEs is 
indicated by arrows. In the inhibi¬ 
tion of M pro , route I was adopted. 
Cbz, the benzyloxycarbonyl group; 
AGln, aza-glutamine; EP, epoxide; 
COOEt, ethyl ester. 
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Table 1 . Parameters and statistics derived from X-ray 
diffraction data processing and structure refinement 



Unbound 

M pro 

Unbound 

MP£ ( . a) :APE 

A. Data processing 
Wavelength (A) 

1.116 

1.116 

1.116 

Resolution 

41.25-1.82 

24.85-2.00 

24.78-2.00 

limit a (A) 

(1.89-1.82) 

(2.07-2.00) 

(2.07-2.00) 

Space group 

P2i 

p 4 3 2i2 

p 4 3 2i2 

Unit-cell 
parameters 
a (A) 

52.39 

70.29 

70.09 

H A) 

96.19 

70.29 

70.09 

C( A) 

67.91 

102.87 

103.86 

a (deg.) 

90.00 

90.00 

90.00 

P (deg.) 

102.91 

90.00 

90.00 

7 (deg.) 

90.00 

90.00 

90.00 

Mosaicity (deg.) 

0.63 

0.65 

0.58 

No. unique 

56,644 (5313) 

17,964 (1739) 

18,098 (1760) 

reflections 

Redundancy 

2.0 (1.8) 

6.8 (6.2) 

7.1 (6.5) 

Completeness (%) 

96.4 (90.6) 

99.3 (97.8) 

99.8 (99.4) 

R S ym b (%) 

3.6 (20.1) 

8.8 (48.8) 

7.4 (44.4) 

<//a(l)> 

20.0 (3.4) 

19.6 (4.2) 

23.7 (4.9) 

B. Structure refinement 
Resolution 

41.25-1.82 

24.85-2.00 

24.78-2.00 

range (A) 

Rwork c (%) 

17.0 

17.5 

19.6 

RfreC (%) 

21.7 

24.1 

27.0 

Number of non-hydrogen atoms per asymmetric unit (average 

B-factor, A ) 

Protein 

4753 (30.12) 

2332 (23.00) 

2332 (23.93) 

APE 

Not 

Not 

46 (22.69) 

Solvent 

applicable 
591 (37.81) 

applicable 
162 (34.38) 

127 (32.49) 

rms deviation from ideal geometry 
Bond lengths (A) 0.019 

0.019 

0.022 

Bond angles (deg.) 

1.778 

1.765 

2.064 

Ramachandran plot 
Favored (%) 

91.3 

90.0 

90.0 

Allowed (%) 

7.3 

8.5 

8.1 

Generously 

0.6 

0.8 

0.8 

allowed (%) 
Disallowed (%) 

0.8 

0.8 

1.2 


a Numbers in parentheses refer to the highest resolution bins. 

Rsym — J' 1 hklJ'ji\lhkl.i ^-hikD | / N hkl N jlhkl.it where Ihkl,i and < Ij t kl > 

are the zth observed intensity and average intensity of the 
reflection hkl, respectively. 

c ^work=Z||fol “ |fc||/Z|fo|, where |F 0 | and |F C | are the 
observed and calculated structure factor amplitudes of a 
particular reflection, respectively. The summation is over 95% of 
the reflections in the specified resolution range. The remaining 5% 
of the reflections were selected randomly before the structure 
refinement and are not included in the structure refinement. R{ ree 
was calculated over these reflections with the equation used for 
7 ? 54 

iV work- 


those reported previously/ Each asymmetric unit 
contains both protomers (A and B) of the physio¬ 
logical dimer. In the electron density maps of the 
unbound M pro , residues 1A-44A, 50A-305A and 
1B-302B could be identified. In the Ramachandran 
plot of this structure, Asp33A, Asn84A and Asn84B 
are in the generously allowed regions, and Asp33B, 
Glu47B, Tyrl54A and Tyrl54B are in the disallowed 
regions. In both independent protomers, the Asp33 
N atom forms a hydrogen bond with the Thr98 O yl 
atom, and the Asp33 carbonyl O atom forms 


hydrogen bonds with the Trp31 N and the Asn95 
N atoms. Also, the Asn84 N (52 atom forms a 
hydrogen bond with the Glul78 carbonyl O atom. 
The poorly defined electron densities of the side- 
chains of Glu47B, Tyrl54A and Tyrl54B indicate 
dynamic disorder. 

SARS-CoV M£a(-i) crystallized in space group 
P2 1 2 1 2 1 z9 as well as P4 3 2!2. Each asymmetric 
unit of the latter contains only one protomer of 
the dimer; the two protomers of each dimer are 
related by the crystallographic 2-fold symmetry 
axis parallel with the C-face diagonal of the unit 
cell. In the electron density maps of the unbound 
M£a(-i)/ residues 3-300 of the protomer were 
clearly identified. In the Ramachandran plot of 
this structure, Asp33 and Seri 39 are in the gen¬ 
erously allowed regions, and Asn84 and Tyrl54 are 
in the disallowed regions. The side-chain of Tyrl54 
makes contact with those of Ile78 and probably 
Arg76 from a neighboring asymmetric unit. The 
electron density of Seri39 is not well defined. In 
the electron density maps of the M^a(-i) : APE 
complex, residues 2-300 of the protomer were 
identified. In the Ramachandran plot of this 
structure, Asp33 and Asn277 are in the generously 
allowed regions, and Asn84, Tyrl54 and Ile286 are 
in the disallowed regions. The side-chains of 
Thr285 and Ile286 from opposite protomers of the 
dimer contact each other. The electron density of 
Asn277 is not well defined. 

Superpositions of the crystal structures reported 
here with all of those previously reported show no 
significant difference in the protomer orientation 
and the overall fold. 

Active sites and substrate-binding regions of the 
unbound SARS-CoV M pro 

The previously reported crystal structure of 
SARS-CoV M pro in space group P2 1 at pH 6.0 
showed the collapse of the active site and SI 
specificity pocket of one of the protomers, whereas 
the structures in the same space group at pH 7.6 
and at pH 8.0 showed all the active sites to be in 
the catalytically competent conformation (Table 2). 
On the basis of this trend, a pH-triggered switch for 
the catalytic activity of M pro was proposed. 25 We 
have now grown crystals of M pro in the same space 
group (P2 1 with the same unit-cell constants) at pH 
6.0 under slightly different conditions; the active 
sites and the SI specificity pockets of both proto¬ 
mers are in the catalytically competent conforma¬ 
tion (Figures 2(a) and (b), 3(a) and (b), 4(a) and (b)). 
More specifically, superposition of protomers A 
and B of the resulting M pro structure (rmsd 0.50 A 
for 1070 out of 1172 main-chain atoms) shows good 
agreement in most atomic positions. In both 
independent protomers, the catalytic dyad has a 
distance of 3.6 A between the His41 N s2 atom and 
the Cysl45 S 7 atom, and the Cysl45 S 7 atom is 
coplanar with the atoms of the His41 imidazole 
ring. Residues Glyl43 to Cysl45 are in the proper 
conformation to form the oxyanion hole that 
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Table 2. Crystal structures of SARS-CoV M pro (wild-type 
and variants) determined in different conditions 


Ref. 

Inhibitor 

Space 

group 

pH 

In the catalytically 
competent conformation? 

Protomer A a Protomer B a 

Wild-type SARS-CoV tvT 0 

b - P2 1 

6.0 

Yes 

Yes 

36 

— 

p 4 3 2 1 2 

5.9 

No 

No 

25 

— 

P2i 

6.0 

Yes 

No 

25 

CMK 

P2 1 

6.0 

Yes 

No 

29 

— 

C2 

6.5 

Yes 

Yes 

29 

APE 

C2 

6.5 

Yes 

Yes 

37 

— 

P2 1 2 1 2 

6.5 

Yes 

Yes 

36 

— 

P2{2{2 

6.6 

No 

No 

42 

— 

P2{2{2 

7.0 

Yes 

Yes 

25 

— 

P2i 

7.6 

Yes 

Yes 

25 

— 

P2 X 

8.0 

Yes 

Yes 

SARS-CoV MEa(-d 

P4 3 2 x 2 

6.5 

No 

No 

b 

APE 

P4 3 2i2 

6.5 

Yes 

Yes 

29 

APE 

P2 1 2 1 2 1 

6.5 

Yes 

Yes 


SARS-CoV M pro with additional Ser-Leu at the N termini of both 
protomers 


c - P2i 6.5 

Yes 

Yes 

SARS-CoV NT 0 C145A variant 

42 product C2 9.0 

Yes 

Yes 


SARS-CoV M pro with additional Gly-Pro-Leu-Gly-Ser at the N termini 


of both 

protomers 





27 

12 

P2 1 

6.0 

Yes 

Yes 

27 

N1 

P2 1 

6.0 

Yes 

Yes 

27 

N3 

P2i 

6.0 

Yes 

Yes 

27 

N9 

P2i 

6.0 

Yes 

Yes 


a For any structure whose asymmetric unit contains only one 
protomer, that protomer represents both protomer A and 
protomer B. 

b These are the structures reported here. 
c PDB accession code 1Q2W; J. B. Bonanno et al., unpublished 
results. 


accommodates the carbonyl O atom of the scissile 
peptide bond of the substrate; the position to be 
occupied by the carbonyl O atom is occupied by a 
water molecule. The O atom of the water molecule 

o 

forms hydrogen bonds at distances of 3.0 A with 
the Glyl43 N atom and with the Cysl45 N atom 
(Figure 3(a) and (b)). 

Previous studies suggest strongly that the pre¬ 
dominant SI specificity of SARS-CoV M pro for Gin 
is determined primarily by the conserved residue 

1 r I / or t V J 

Hisl63. ' ' In both independent protomers, the 
orientation of the imidazole ring of Hisl63 is 
determined by the hydrogen bonding between its 
N 61 atom and the OH group of Tyrl61 (3.0 A) and by 
its 7r-stacking with the phenyl ring of Phel40 (the 
distance between the geometric centers of the 
aromatic rings is 3.9 A). The position to be occupied 
by the side-chain carbonyl O atom of PI-Gin of the 
substrate is occupied by a water molecule, whose O 
atom forms a hydrogen bond with the Hisl63 N 82 
atom (2.9 A); this water molecule is coplanar with 
the atoms of the imidazole ring of Hisl63. In both 
protomers, Phel40 and Glul66 interact with Seri of 
the opposite protomer to form the "floor" of the SI 


specificity pocket (Figure 3(a) and (b)). In protomer 
A, the Glul66A O el atom forms a hydrogen bond 
(2.8 A) with the Hisl72A N 82 atom, thereby 
constituting a "side-wall" of the SI specificity pocket 
similar to that in protomer A of previously reported 
structures and the molecular-dynamic simulation 
model of M pro at pH 6.0 (Figure 3(a)). 25 ' 36 However, 
in protomer B of the structure of M pro reported here, 
the side-chain of Glul66B rotates away and the 
Hisl72B N s2 atom forms a hydrogen bond (3.0 A) 
with the Seri A O 7 atom of protomer A instead. The 
SerlA O 7 atom also forms a hydrogen bond (2.8 A) 
with the Glyl70B carbonyl O atom (Figure 3(b)). The 
difference in the side-chain conformation of Glul66 
in protomers A and B of the structure of M pro 
reported here indicates the weakness of the interac¬ 
tion between the side-chains of Glul66 and Hisl72, 
even though this interaction may acquire some ionic 
character by the protonation of the Hisl72 N 61 atom. 
Contrary to those in protomer B of both the 
previously reported structure and the molecular- 
dynamic simulation model of M pro at pH 6.0, 25 ' 36 the 
side-chains of Hisl63B and Glul66B do not interact 
in protomer B of the structure of M pro reported here, 
probably because Hisl63B is protonated only at its 
N 82 atom and carries no charge. 

As in the previously reported structure of M pro at 
pH 6.0, 25 protomers A and B in the current structure 
show good agreement in the rest of the substrate¬ 
binding region, including the S2, S4 and Sl r 
specificity pockets. Superpositions with the crystal 
structures of M pro in the other conditions show that 
the rest of the substrate-binding regions in the 
structure of M pro reported here are in the catalyti- 
cally competent conformation. 

Active sites and substrate-binding regions of the 
unbound SARS-CoV 

We have reported the crystal structure of the 
SARS-CoV M£a(-i)'APE complex in space group 
P2 1 2 1 2 1 , whose asymmetric unit contains both 
protomers of the M£a(-i) dimer. In that structure, 
the active sites and the substrate-binding regions of 
both protomers are in the catalytically competent 
conformation (Table 2). In both independent proto¬ 
mers, the additional Ala at the N terminus blocks 
Seri and disrupts its interactions with Phel40 and 
Glul66 of the opposite protomer; however, the floor 
of the SI specificity pocket is only partly disrupted. 
More importantly, the presence of a ten-residue 
affinity tag at the N terminus of both independent 
protomers reduces the specific activity of M pro by 
less than an order of magnitude.^ 9 Similar observa¬ 
tions are given by the crystal structure of M pro with 
additional residues Ser-Leu at the N terminus of 
both independent protomers (PDB accession code 
1Q2W; J. B. Bonanno et al., unpublished results). 

Attempts to crystallize M£a(-i) in space group P2 1 
have not been successful. Interestingly, M£a(-i) crys¬ 
tallizes in space group P4 3 2!2 as well as in space 
group P2i2i2i under the same conditions. In space 
group P4 3 2!2, each asymmetric unit has one 
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protomer of M£a(-i)- Superpositions of the resulting 
structure of the unbound M£a(-i) with the crystal 
structures of M pro in the other conditions show no 
difference in the overall fold or the protomer orien¬ 
tation. Crystal contacts of the unbound M£a(-i) in 
space group P4 3 2 1 2 do not involve any residues 
forming the active sites and the SI specificity 
pockets of the peptidase. The structure of the 
unbound M5 a(-i) i n space group P4 3 2 1 2 shows 
good agreement in most atomic positions with the 
structure of the APE-bound M£a(-i) in space group 
P2 1 2 1 2 1 . In the structure of the unbound M£a(-i) re¬ 
ported here, the catalytic dyad has a distance of 
3.9 A between the His41 N s2 atom and the Cysl45 S 7 
atom, and the Cysl45 S 7 atom is coplanar with the 
atoms of the His41 imidazole ring. However, the 
oxyanion hole and the SI specificity pocket are 
distorted (Figures 2(c) and 3(c)). The $ and vff angles 
of residues Lysl37 to Serl44 show dramatic differ¬ 
ences compared with those of the previously 
reported structure of the APE-bound M?a(-i) 
(Figure 4(c)). The above-average B-factors of these 
residues in the unbound M£a(-i) indicate their high 


mobility relative to the rest of the peptidase (Figure 
5(c)). The oxyanion hole is not distorted as much; the 
N atoms of Glyl43 and Cysl45 are still oriented to 
donate hydrogen bonds that would stabilize the 
negatively charged carbonyl O atom of the scissile 
peptide bond of the substrate, although no water 
molecule is found at the position to be occupied by 
the carbonyl O atom (Figure 3(c)). 

The hydrogen bond between the N 61 atom of 
Hisl63 and the OH group of Tyrl61 is preserved 
(3.1 A); however, the imidazole ring of Hisl63 is no 
longer 7r-stacked with the phenyl ring of Phel40; it 
makes contacts with the side-chain of Feul41 
instead (Figure 3(c)). Here the phenyl ring of 
Phel40 makes contacts with the side-chains of 
Valll4, Tyrl26, Ilel36 and Hisl72, and the main- 
chain atoms of Fysl37 and Glyl38 from the parent 
protomer, and with the side-chain of Arg4 from the 
opposite protomer. The Phel40 carbonyl O atom 
also forms a long hydrogen bond with the OH group 
of Tyrll8 (3.4 A). The electron density maps show 
two possible conformers of the side-chain of Glul66; 
the occupancies of both conformers were fixed at 0.5 
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Figure 2 (legend on next page) 
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Figure 2. Electron densities in 
the F 0 -F c omit maps for residues 
Lysl37 to Serl44 of SARS-CoV 
M pro and MGy-i)- ( a ) Protomer A 
of the unbound M pro . (b) Protomer B 
of the unbound M pro . (c) Unbound 
(d) APE-bound M p G ( -i). 
The residues in (c) are in the cata- 
lytically incompetent conformation. 


without refinement. In conformer 1, the Glul66 O 

0-1 

atom forms a hydrogen bond with the Hisl72 N ( 
atom (2.3 A), and the Glul66 O 82 atom forms a 
hydrogen bond with the Hisl63 N s2 atom (2.8 A); 
whereas in conformer 2, the side-chain of Glul66 
protrudes into the solvent (Figure 3(c)). Probably 
because of their high mobility, the additional Ala at 
the N terminus, Seri and Gly2 of the protomer could 
not be identified in the electron density maps of the 
unbound M£a(-i)- Interactions similar to those 
forming the floors of the SI specificity pockets of 
M pro are not observed in the unbound M£a(-i)- 

The rest of the substrate-binding region in the 
structure of the unbound M£a(-i) reported here is in 
the catalytically competent conformation, in good 
agreement with those in the previously reported 
structures of the APE-bound M?a(-i) and of M pro , 
except for some side-chains whose conformational 
rearrangements are necessary in order to accommo¬ 
date APE or the other inhibitors. 

Binding of APE to SARS-CoV 

Both the P2 1 crystals of SARS-CoV M pro and the 
P4 3 2i2 crystals of SARS-CoV MGy-p were soaked 


in solutions of the APE synthesized for this study 
in three stereochemical versions: the two diaster- 
eomers Cbz-Leu-Phe-AGln-(2S,3S)EP-COOEt and 
Cbz-Leu-Phe-AGln-(2R,3R)EP-COOEt, and the ra¬ 
cemic mixture of these two diastereomers. Out¬ 
standing electron density for APE was observed 
only in the electron density maps of M£a(-i> it 
could be fit only by the 2S,3S diastereomer 
(Figure 6(a)). The latter is consistent with the 
results of the studies on the inhibition of M pro by 
the APE in these versions. These results can be 
explained with the models of all four of the 
possible diastereomers of the APE binding to 
M p ro. 29 Superpositions of the structures of the 
unbound and the APE-bound M£a(-i) reported 
here show that the binding of the APE does not 
grossly affect the overall fold or the protomer 
orientation of the M£a(-i) dimer. The same obser¬ 
vations are found in the superpositions of the 
previously reported crystal structures of the un¬ 
bound and the APE-bound M pro in space group 
C2. 29 The structure of the APE-bound MG\(-i) in 
space group P4 3 2!2 also agrees well in most ato¬ 
mic positions with that of the APE-bound M£a(-i) in 
space group P2 1 2 1 2 1 . 
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The structural consequences of the binding of the 
APE to M£a(-i) are very similar to those observed in 
the crystal structures of the unbound and the APE- 
bound M pro in space group C2, and the APE-bound 
M£a(-i) in space group P2 1 2 1 2 1 . 29 A covalent bond 
forms between the Cysl45 S 7 atom of M£a(-i) and 
the epoxide C3 atom of the APE (2.15 A; Figure 6(b) 
and (c)). In the refinement of all of the structures of 
the APE-bound M pro and M£a(-i)/ a restraint was 
applied to this C-S bond on the basis that the length 
of a C-S single bond is normally about 1.8 A. 
Interestingly, the opened epoxide moiety of the APE 
always tended to be tilted away from Cysl45 of 
M pro or M5 a^_i), thereby lengthening this C-S bond 
by 0.2-0.3 A, even though the opened epoxide 
moiety of the APE was manually moved back 
towards Cysl45 of M pro or M£a(-i) regularly during 
the refinement process. Although such lengthening 
is not considered significant as the overall positional 
uncertainties (based on maximum likelihood) of all 
these structures are in the range of 0.2-0.3 A, it does 
suggest the possibility of this C-S bond being under 
strain and vulnerable to rupture caused by a second 
nucleophilic attack at the epoxide C3 atom of the 
APE (say, by an activated water molecule), leading 


to the speculation that the APE could act as a 
reversible inhibitor of M pro . 

The conformation of the opened epoxide moiety 
and the main-chain conformation of Pl-AGln of the 
APE in the structure of the M£a(-i) : APE complex 
reported here are essentially the same as those in the 
previously reported structures of the M pro :APE and 
the M£a(-i) : APE complexes (Figure 7(a)-(d)). Unlike 
the PI C a atom of other inhibitors, the Pl-AGln N a 
atom of the APE is sp -hybridized and has a trigonal 
planar geometry; in order to be accommodated by 
the SI specificity pocket of the peptidases, the side- 
chain of Pl-AGln has to adopt a different conforma¬ 
tion, in particular the equivalent to x (N-N a -C p -C 7 ). 
This angle is -123.6° in the structure of the M£a(-i) : 
APE complex reported here; by contrast, in the 
crystal structures of the complexes of an M pro variant 
with a series of peptidomimetic inhibitors, this angle 
is in the range of -65° to -80°. 27 

Interestingly, the binding of the APE induces the 
recovery of the catalytically competent conforma¬ 
tion of the oxyanion holes and the SI specificity 
pockets of M£a(-i) (Figures 2(d) and 3(d)). The Pl- 
AGln main-chain carbonyl O atom of the APE is 
accommodated in the oxyanion hole of M^a(-i)/ 


(a) 


H41A 



C145A 


w 


G143A 


H163A 


Y161A 




66A 


F140A 


H172A 


(b) 


H41B 



w 


C145 ?^ /VG143B 


H163B 


Y161B A 




F140B 

H172B 


H41A 



C145A 


w 

a G143A 


H163A 


Y161A 




F140A 


H172A 


H41B 



w 


C 14 5B_ AG143B 


H163B 


Y161B A 



S1A 



F140B 

H172B 


S1A 


G170B 


G170B 


Figure 3 (legend on next page) 






Induced-fit Binding of APE to SARS-CoV M pro 


923 


(c) 
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Figure 3. Active sites and SI specificity pockets of SARS-CoV M pro and MSa(-i)/ viewed from the S2 specificity pockets, 
(a) Protomer A of the unbound M pro . (b) Protomer B of the unbound M pro . (c) Unbound MJa(-i)- (d) APE-bound 
M£a(-i) (For clarity, the APE is not shown.). Hydrogen bonds are indicated by broken lines. Water molecules are labeled w. 


forming hydrogen bonds with the N atoms of 
Glyl43 (2.8 A) and Cysl45 (3.2 A) (Figure 6(b) and 
(c)). The cj) and v|i angles of residues Lysl37 to Serl44 
in the structure of the APE-bound M£a(-i) reported 
here are essentially the same as those in the 
previously reported structure of the APE-bound 
M£a(-i) (Figure 4(d)). In contrast to those in the 
structure of the unbound M£a(-i ), the B-factors of 
these residues in the structures of the APE-bound 
MJa(-i) are close to the averages, indicating that the 
mobility of these residues is reduced upon the 
binding of the APE (Figure 5(d)). 

The hydrogen bond between the N 61 atom of 
His 163 and the OH group of Tyrl61 is preserved 
(3.3 A), and the 7v-stacking of the imidazole ring of 
Hisl63 with the phenyl ring of Phel40 is recovered 
(distances between o the geometric centers of the 
aromatic rings: 3.7 A) (Figure 3(d)). The Hisl63 N s2 
atom no longer interacts with the Glul66 O s2 atom, 
but forms a hydrogen bond (2.7 A) with the side- 
chain carbonyl O atom of Pl-AGln of the APE 
instead (Figure 6(b) and (c)). The side-chains of 


Glul66 and Hisl72 interact with each other (3.0 A; 
Figure 3(d)). The additional Ala at the N terminus 
and Seri of the protomer could not be identified in 
the electron density maps of the M£a(-i) : APE 
complex. Interestingly, the side-chain amide N 
atom of Pl-AGln of the APE forms hydrogen 
bonds, though not in ideal geometry, with the 
Phel40 carbonyl O atom (3.4 A) and the Glul66 
O sl and O s2 atom (3.5 A and 2.9 A, respectively) 
(Figure 6(b)). Therefore, without the participation of 
Seri of the opposite protomer, Phel40 and Glul66 
are tied together to form parts of the floors of the SI 
specificity pockets of M£a(-i)- 
Similar to the APE in protomer A of the previously 
reported structure of the APE-bound M£a(-i)/ the 
benzyloxycarbonyl (Cbz) group of the APE in the 
structure of the APE-bound M£a(-i) reported here 
squeezes into and slightly widens the S4 specificity 
pockets of MJa(-i) and, as a result, making contacts 
with Leul67, Prol68, Glnl92 and Alal93 of M p a(-i } - 
Otherwise, the interactions of the APE with the rest 
of the substrate-binding region observed in the 
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(a) 


Unbound M pro (Protomer A) 



(b) 


Unbound M pro (Protomer B) 



(c) 


Unbound M pro 


+A(-1) 



(d) 


APE-bound M pro 


+A(-1) 



Figure 4. Main-chain conformational angles of resi¬ 
dues Thrl35 to Glyl46 in the crystal structures of SARS- 
CoV M pro and M£a(-i) reported here. The $ and the i|; 
angles are represented by the yellow and the blue bars, 
respectively, (a) Protomer A of the unbound M pro (overall 
positional uncertainty estimated based on maximum 
likelihood, a r : 0.15 A), (b) Protomer B of the unbound 
M pro (a r : 0.15 A), (c) Unbound M£a(-i) (a r : 0.23 A), (d) 
APE-bound M$a(-i) (a r : 0.28 A). 


structure of the APE-bound M£a(-i) reported here 
are essentially the same as those observed in the 
previously reported structures of the APE-bound 
MJa(-i) and M pro (Figure 6(b) and (c)). 

Dynamic equilibrium for the conformation of the 
active sites and the SI specificity pockets of 
SARS-CoV M pro 

In contrast to the previously reported crystal 
structure of SARS-CoV M pro in space group P2 1 at 
pH 6.0, 25 the crystal structure of M pro reported here 
(in the same space group and at the same pH) shows 
that the active sites and the SI specificity pockets of 
both protomers are in the catalytically competent 
conformation (Table 2). In the determinations of both 
structures, the complete wild-type sequence (resi¬ 
dues 1-306) of M pro was over-expressed, purified 
and crystallized. Although different strategies might 
have been used in the X-ray diffraction data 
collection and processing, and in the structure 
solution and refinement, this could not result in the 
dramatic structural differences observed. The struc¬ 
tural differences probably arise from the differences 
in the conditions of the preparation and the 
crystallization of M pro . Similarly, the crystal struc¬ 
ture of M pro in space group P2 1 2 1 2 at pH 6.5 shows 
that both protomers are in the catalytically compe¬ 
tent conformation, 37 whereas that in the same 
space group P2 1 2 1 2 and at a slightly higher pH 
(6.6) shows that both protomers of the latter are in 
the catalytically incompetent conformation. 36 The 
conformations of the collapsed active sites and SI 
specificity pockets observed in the catalytically 
incompetent protomers show some variability. This 
variability indicates that the active site and the SI 
specificity pocket of each protomer of M pro do not 
adopt a single conformation in solution at pH 5.9- 
6.6, but instead there is an ensemble of conforma¬ 
tions. A particular conformation might be favored 
by a particular set of crystallization conditions. 

However, the various conformations of the col¬ 
lapsed active sites and SI specificity pockets share a 
common feature: the imidazole ring of Hisl63 is not 
7r-stacked with the phenyl ring of Phel40.7r-stacking 
is an example of aromatic interactions. 38 In the 
structures of the unbound M pro , an aromatic inter¬ 
action is observed between these two rings in an 
offset-stacked (i.e. ^-stacking) or an edge-to-face 
fashion wherever the protomer is in the catalytically 
competent conformation (Figure 8). In both orienta¬ 
tions, one or two hydrogen atoms (with partial 
positive charge) on the phenyl ring of Phel40 are 
positioned near the central region (with partial 
negative charge) of the imidazole ring of Hisl63. 
According to the results of previous studies, 39,40 a 
single aromatic interaction as such is weak (the 
interaction energy may be in the range of 1-2 kcal/ 
mol only), in contrast to the clusters of aromatic 
interactions commonly involved in the stabilization 
of protein structures. 41 Therefore, this interaction is 
susceptible to disruptions that could be caused by 
changes in a number of factors; the formation of this 
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interaction can be viewed as a reversible process in 
dynamic equilibrium: 

Aromatic interaction <-► No aromatic interaction 

(i) 

The position of the equilibrium could be deter¬ 
mined, in part, by pH. At low pH (near or below 
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the pK a i of Hisl63; Figure 9), the Hisl63 N 61 atom 
in a significant number of the M pro protomers is 
protonated, thereby introducing a positive charge 
on the imidazole ring of Hisl63. This would 
disfavor its aromatic interaction with the phenyl 
ring of Phel40, and the position of the equilibrium 
would shift to the right. This is consistent with the 
trend exhibited by the structures of the unbound 
M pro at various pH values (Table 2). In the pH 
range of 7.0-9.0, most of the M pro protomers have 
the aromatic interaction and are in the catalytically 
competent conformation (the left-hand side of the 
equilibrium predominates); whereas in the pH 
range of 5.9-6.6, some of the M pro protomers lose 
the aromatic interaction and are in the catalytically 
incompetent conformation (the position of the 
equilibrium shifts to the right). 

A second factor in determining the position of the 
equilibrium could be the integrity of the interactions 
among Phel40 and Glul66 of the parent protomer, 
and Seri of the opposite protomer. In the structure 
of M£a(-i) reported here, the additional Ala at the N 
terminus of the opposite protomer blocks Seri and 
disrupts its normal interactions with Phel40 and 
Glul66 of the parent protomer. This probably 
weakens the conformational anchor of Phel40, as 
indicated by its above-average B-factors (Figure 
5(c)), making the aromatic interaction of its phenyl 
ring with the imidazole ring of Hisl63 vulnerable to 
disruption. In all of the crystal structures of the 
unbound M pro (both the wild-type and the variants) 
determined so far, wherever Phel40 and Glul66 of 
the parent protomer, and Seri of the opposite 
protomer do not interact normally, the B-factors of 
residues Lysl37 to Serl44 of the parent protomer 
are above the average, even if the parent protomer 
is in the catalytically competent conformation (e.g. 
the M pro structure of M pro at pH 6.5 in the space 
group P2 1 2 1 2); 37 in contrast, wherever the three 
residues interact normally, the B-factors of residues 
Lysl37 to Serl44 of the parent protomer are close to 
the average (e.g. protomer A of the M pro structure 
reported here; Figure 5(a)). These observations 
suggest that the interactions among the three 
residues can immobilize residues 137 to 144. 
Apparently, both the ionic interaction between the 
N terminus of Seri and the side-chain of Glul66 of 
opposite protomers, and the amide hydrogen- 
carbonyl oxygen hydrogen bonds between Seri 
and Phel40 of opposite protomers contribute to the 
immobilization of residues Lysl37 to Seri44. The 


Figure 5. The B-factors of residues Thrl35-Glyl46 in 
the crystal structures of SARS-CoV M pro and M£a(-i) re¬ 
ported here. The main chain and the side-chain B-factors 
are represented by the red and the green bars, respective¬ 
ly. The average values of the main chain and the side- 
chain B-factors are indicated by the red and the green 
lines, respectively. There is no green bar for Glyl38, 
Glyl43 or Glyl46, because these residues do not have 
side-chains, (a) Protomer A of the unbound M pro . (b) 
Protomer B of the unbound M pro . (c) Unbound M5 a(-i)- 
(d) APE-bound M£a(-i)- 
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Figure 6. Interactions of the APE (orange) with SARS-CoV M5 a(-i) (green), (a) Outstanding density in the F 0 - F c 
map of the M2a(-i)-APE complex, (b) View from the 'floors' of the SI specificity pockets of M£a(-i)- (c) A diagram of the 
interactions. Hydrogen bonds are shown as broken lines, with their distances (in A) given alongside. The residues of 
M5 a(-i) i n contact with the APE are shown as arcs. 
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Figure 7. Newman projections 
of the APE. (a) The equivalent to $ 
of Pl-AGln, C(P2-Phe)-N(Pl- 
AGln)-N a (Pl-AGln)-C (Pl-AGln); 
(b) the equivalent to i|i of Pl-AGln, 
N(Pl-AGln)-N a (Pl-AGln)-C(Pl- 
AGln)-C3(epoxide); (c) the torsion 
angle 0=C(Pl-AGln)-C3(epoxide)- 
S 7 (Cysl45); and (d) the torsion 
angle C(Pl-AGln)-C3(epoxide)-C2 
(epoxide)-C(ethyl ester carbonyl). 


M pro structure at pH 7.0 in space group P2i2i2 42 
and protomer B of the M pro structure reported here 
(Figure 5(b)) suggest that the absence of either 
interaction slightly compromises the immobilizing 
effect. 

The increased vulnerability of the aromatic 
interaction to disruption would shift the position 
of the equilibrium to the right. This probably 
explains why the active sites and the SI specificity 
pockets of both protomers are collapsed in the 
crystal structure of M£a(-i) reported here. M pro with 
its three N-terminal residues truncated, M pr °A(i- 3 ), 
exists mainly as a dimer in solution but its activity 
is 24% lower than that of M pro . 14 Without the 
interactions among Phel40 and Glul66 of the 
parent protomer, and Seri of the opposite proto¬ 
mer, the position of the equilibrium would shift to 
the right, resulting in the decreased availability of 
the active sites and the SI specificity pockets in the 
catalytically competent conformation. The binding 
of a substrate to M pr °A(i-3) may induce the recovery 
of the catalytically competent conformation of the 
active sites and the SI specificity pockets (to be 
discussed below), at an energetic cost accounted for 
by the reduced activity of M pr °A(i-3). 

Induced-fit binding of the APE to SARS-CoV 
M pro 


The crystal structures of the unbound and the 
APE-bound SARS-CoV M£a(-i) reported here show 
that the binding of the APE to M£a(-i) follows an 
induced-fit model, not the lock-and-key model. 
More particularly, the binding of the APE induces 
the recovery of the catalytically competent confor¬ 
mation of the oxyanion holes and the SI specificity 
pockets of both protomers of M£a(-i)- This induced 
fit is likely relevant to the binding of APE to M pro , 
because of the conformational flexibility of the 
active sites and the SI specificity pockets of M pro 
discussed above. In all of the crystal structures of 


the inhibitor-bound M pro (both the wild-type and 
the variants) determined so far, ' ' except for 
protomer B in the complex of M pro with a 
chloromethyl ketone (CMK), 2 the active sites and 
the SI specificity pockets are in the catalytically 
competent conformation (Table 2). The induced-fit 
binding of the APE to M£a(-i) probably is driven by 
the formation of the covalent bond between the 
Cysl45 S 7 atom of M£a(-i) and the epoxide C3 
atom of the APE. It would be energetically costly to 
break this C-S bond in order to repel the APE and 
preserve the conformational flexibility of the active 
sites and the SI specificity pockets of M£a(-i)- 
Therefore, the active sites and the SI specificity 
pockets of M£a(-i) have to adopt the catalytically 
competent conformation in order to accommodate 
a substrate mimic like APE. This explanation 
probably applies to the actual catalytic cycle of 
M pro as well: the acylation step (i.e. formation of 


offset-stacked 

H163 



F140 


edge-to-face 


H163 



Figure 8. Aromatic interactions observed between 
Phel40 and Hisl63 in the crystal structures of SARS- 
CoV M pro (wild type and variants). The broken lines 
indicate the alignment of the partial positive charges (8+) 
of the hydrogen atoms on the phenyl ring of Phel40 with 
the partial negative charges (8—) in the central part of the 
imidazole ring of Hisl63. 
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Figure 9. Protonation/deprotonation states of Hisl63 hydrogen bonded to Tyrl61 in SARS-CoV M pro (wild-type and 
variants). K al is the dissociation constant for the protonation of the Hisl63 N 61 atom; K a2 is the dissociation constant for 
the deprotonation of the Hisl63 N s2 atom. 


the covalent acyl-enzyme intermediate) may drive 
the induced-fit binding of a substrate to M pro ; after 
the deacylation step (i.e. breakdown of the covalent 
acyl-enzyme intermediate), the removal of the 
product may be favored by the recovery of the 
conformational flexibility of the active sites and the 
SI specificity pockets of M pro . 

As the side-chain of Pl-AGln of the APE occupies 
the SI specificity pockets of M£a(-i ), the side-chain 
amide N atom of Pl-AGln of the APE donates 
hydrogen bonds to the Phel40 carbonyl O atom and 
to the Glul66 O el and O 82 atoms of M£a(-i> in 
addition to the hydrogen bond between the side- 
chain carbonyl O atom of Pl-AGln of the APE and 
the His 163 N 82 atom of M£a(-i)- Similarly, additional 
hydrogen bonds are observed in the crystal struc¬ 
tures of the M£a(-i) : APE complex in space group 
P2 1 2 1 2 1 , the M pro :APE complex, 29 protomer A of the 
M pro :CMK complex, 25 and the complex of M pro with 
a peptidomimetic inhibitor. 26 In the crystal struc¬ 
tures of the complexes of an M pro variant with a 
series of peptidomimetic inhibitors, these hydrogen 
bonds seem to be water-mediated. Consequently, 
even without the participation of Seri of the 
opposite protomer of M£a(-i)/ Phel40 and Glul66 
are tied together by these hydrogen bonds to form 
parts of the floors of the SI specificity pockets. These 
hydrogen bonds also conformationally anchor 
Phel40, as indicated by its close-to-average B-factors 
(Figure 5(d)), and make the 7r-stacking of its phenyl 
ring with the imidazole ring of Hisl63 resistant to 
disruption. As a result, the catalytically competent 
conformation of the active sites and the SI specificity 
pockets of M£a(-i) is rigidified; the position of the 
equilibrium described by equation (1) would shift to 
the left. This probably serves as an additional 
explanation for the predominant SI specificity of 
M pro or M p i° ( _i) for Gin, but not for Glu. Comparison 
of the structure of the unbound M pro with that of the 
APE-bound M pro , both in space group C2 at pH 
6.5, 29 does not show an obvious reduction in the 13- 
factors of residues Lysl37 to Serl44 upon the 
binding of the APE to M pro , probably because 


Phel40 and Glul66 of the parent protomer, and 
Seri of the opposite protomer in the unbound M pro 
interact normally and immobilize these residues. 

In protomer B of the M pro :CMK complex, the 
binding of the CMK does not induce the recovery of 
the catalytically competent conformation of the 
active site and the SI specificity pocket of M pro 
(Table 2). The side-chain of PI-Gin of the CMK 
cannot be accommodated by the collapsed SI 
specificity pocket, so it protrudes into the solvent. 25 
This could be attributed to the covalent bond 
between the Cysl45 S 7 atom of M pro and the 
methylene C atom of the CMK, whose rotation can 
orient the side-chain of PI-Gin of the CMK towards 
the solvent, thereby avoiding the steric hindrance 
due to the collapsed SI specificity pocket. In 
protomer A of the M pro :CMK complex, although 
the active site and the SI specificity pocket of M pro 
are in the catalytically competent conformation 
(Table 2), allowing the side-chain of PI-Gin of the 
CMK to occupy the SI specificity pocket of M pro , the 
unoccupied space in the rest of the substrate-binding 
region of M pro is large enough for the conforma¬ 
tional rearrangements in the rest of the CMK, 
resulting in an unexpected binding mode of the 
CMK to protomer A of M pro as well. 25 These 
unexpected binding modes are not observed in the 
crystal structures of the complexes of M pro (both the 
wild-type and the variants) with the APE,~ nor with 
the peptidomimetic inhibitors, 26 ' 27 probably because 
their P' ester groups sterically hinder the rotation of 
the C-S covalent bond formed between the pepti¬ 
dases and the inhibitors. In the case of the APE, the 
7v-conjugation of the N a atom and the carbonyl 
group of Pl-AGln restricts the orientation of the 
side-chain of Pl-AGln to point towards the SI 
specificity pockets of M pro or M£a(-i)- In the case of 
the peptidomimetic inhibitors, the C atom attacked 
by the Cysl45 S 7 atom corresponds to the carbonyl 
C atom of the scissile peptide bond of a substrate. 
Therefore, the resulting C-S bond pulls the inhibitor 
towards the back wall of the substrate-binding 
region. This reinforces the steric hindrance due to 
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the V ester group of the inhibitor against the 
rotation of the C-S bond. 

Thus, a refined explanation could be provided for 
the pH-dependence of the activity of ]y[P rol2 ' 18 ' 25 ' 36 : 
At low pH, a significant number of the M pro 
protomers are in the catalytically incompetent 
conformation. However, these protomers are con- 
formationally flexible; the recovery of their catalyt¬ 
ically competent conformation can be induced by 
the binding of a substrate. The activity of M pro is still 
compromised, because of the energetic cost associ¬ 
ated with this induced-fit binding. It would be 
interesting to determine this energetic cost quanti¬ 
tatively, and compare it with the experimentally 
determined decrease in the activity of M pro . Before 
carrying out such studies, the possibility of a second 
mechanism, such as the change in the protonation / 
deprotonation state of the catalytic dyad with pH, 
being involved in the pH-dependence of the activity 
of M pro should not be ruled out. 

Domain III of SARS-CoV M pro 

The compositions of the dimer interfaces observed 
in the structures reported here are essentially 
identical with those observed in all of the previously 
reported structures. Extensive studies have been 
carried out on the role of domain III in the 
dimerization of M pro . With domain III truncated, 
the protomer of M pro does not dimerize over a wide 
range of concentrations, ' whereas domain III 
alone forms a stable dimer even at very low 
concentrations. 10 Truncation of the final C-terminal 
helix (residues 293-306) strongly disfavors the 
dimerization of MP ro ; 14 ' 43 analysis of the dimer 
interfaces observed in the structures of M pro found 
that most of the residues of domain III involved in 
dimer interactions are parts of the final C-terminal 
helix. Superpositions of the previously reported 
structures of M pro in space group P2 1 at pH 6.0, 7.6 
and 8.0, 25 show that domain III of both independent 


protomers rotates in a rigid-body manner as the pH 
increases from 6.0 to 8.0, whereas the remainder of 
the M pro dimer remains essentially in the same 
atomic positions (Figure 10). Domain III of protomer 
B rotates through a slightly greater angle than does 
domain III of protomer A. This rotation was 
regarded as a consequence of the recovery of the 
catalytically competent conformation of the active 
sites and the SI specificity pockets of M pro as the pH 
increased from 6.0 to 8.0. However, the structure of 
M pro in space group P2 1 at pH 6.0 reported here, 
having the active sites and the SI specificity pockets 
of both protomers in the catalytically competent 
conformation, shows good agreement in the atomic 
positions of domain III of both independent proto¬ 
mers with the previously reported structure of M pro 
in space group P2 1 at pH 6.0, but not with those at 
pH 7.6 or pH 8.0. The interactions between the 
domain-III residues of opposite protomers, and the 
interactions of the residues of domain III with those 
of domains I and II of the same protomer have been 
analyzed, but the mechanics of this rotation is still 
unclear. The structures of M pro in the other space 
groups have not been studied over such a wide 
range of pH. It would be interesting to explore 
whether this rotation is an artifact associated with 
crystal packing in space group P2 lr or whether 
similar rotations occur in the structures of M pro in 
the other space groups as well, indicating some 
biochemical significance of this rotation. For exam¬ 
ple, the rotation may represent a mechanism 
underlying the pH-dependence of the dimerization 
behaviors of M pro . 12 

Materials and Methods 

Preparation of SARS-CoV M pro , M p k° ( -i) and APE 

SARS-CoV M£a(-i) was cloned, over-expressed and 
purified as described. 24 A clone expressing M pro was 



Protomer A 


Protomer B 



Protomer A 


Protomer B 


Figure 10. C a traces showing the orientations of domain III of the previously reported structures of SARS-CoV M pro in 
space group P2 1 at pH 6.0 (red), 7.6 (yellow) and 8.0 (blue) 25 , and the structure of M pro in space group P2 1 at pH 6.0 
reported here (green). Domain III of both independent protomers rotates as pH increases from 6.0 to 8.0. 
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generated using oligonucleotide-directed evolution to 
delete the codon corresponding to the N-terminal Ala of 
M£a(-i)- Using this clone, M pro was over-expressed and 
purified essentially as described for MEa(-i)- Cbz-Leu-Phe- 
AGln-(S,S)EP-COOEt was synthesized using the methods 
established to synthesize other APEs 30,32,33 with minor 
modifications. 

Crystallization, crystal soaking and cryo-protection 

Before crystallization, both SARS-CoV M pro and MSa(-i) 
were dialyzed against 20 mM Tris-HCl (pH 7.5), 20 mM 
NaCl and concentrated to 10 mg/ml. All crystals were 
grown at ambient temperature by the hanging-drop, 
vapor-diffusion method. For the P2\ crystals, the reservoir 
solution contained 0.1 M Mes (pH 6.0), 2% (w/v) 
polyethylene glycol 20,000, 3% (v/v) dimethyl sulfoxide, 
and 1 mM dithiothreitol. The final crystallization drop 
contained equal amounts of the M pro solution and the 
reservoir solution. Crystals of a thick-plate habit grew in 
three to five days to a size of about 0.3 mm x 0.3 mmx 
0.1 mm. For the P4 3 2!2 crystals, the growth conditions 
were the same as those for the P2 1 2 1 2 1 crystals; 29 the 
reservoir solution contained 0.1 M Mes (pH 6.5), 50 mM 
ammonium acetate, 6% (w/v) polyethylene glycol 8000, 
3% (v/v) ethylene glycol, and 1 mM dithiothreitol. The 
drop contained equal amounts of the M£a(-i) solution 
and the reservoir solution. Bipyrimidal crystals grew in 
three to five days to a size of about 0.1 mmx0.1 mmx 
0.1 mm. Macroseeding was carried out to improve the 
quality of the crystals. Crystals of good quality were 
selected and soaked overnight in drops having the 
same compositions as their reservoir solutions plus the 
APE chosen for this study present at 3 mM. Cryo- 
protectants had essentially the same compositions as 
reservoir solutions, except for the inclusion of 25% (v/v) 
ethylene glycol and the exclusion of dimethyl sul¬ 
foxide and dithiothreitol. Crystals were soaked for 
about 10 s and then immediately shock-cooled in liquid 
nitrogen for storage and shipment to the synchrotron 
beamline. 

Data collection and processing, structure solution, 
refinement and analysis 

The X-ray diffraction data from all crystals were 
collected at the synchrotron Beamline 8.3.1 (equipped 
with an ADSC-Q210 CCD detector) at the Advanced Light 
Source in the Lawrence Berkeley National Laboratory. All 
data sets were indexed, scaled and merged using DENZO 
and SCALEPACK. 44 Structure solution and refinement 
were carried out in CCP4. 45,46 All structures were solved 

A 1-7 

by the molecular replacement method using MOLREP, 
with the structure of unbound SARS-CoV M pro in space 
group C2 (PDB accession code 2A5A) being used as the 
search model. 29 In the structure of the MJax-i/APE 
complex, APE was located as outstanding electron 
densities in the substrate-binding region of M£a(-i) in 
both the IP 0 I -1 P c I ,a c (contoured at 3 a and 4a) and the 
2 IP G I - I P c I ,a c (contoured at la) maps. A restraint was 
applied to the covalent bond between the Cysl45 S 7 atom 
of M£a(-i) and the epoxide C3 atom of the APE, on the 
basis that the length of a C-S single bond is normally about 
1.8 A. All structures were then iteratively refined using 
REFMAC, 48 and manually adjusted when needed using 
XtalView/Xfit. 49 The stereochemical qualities of the final 
structures were assessed using PROCHECK. 50 Graphical 
representations of the structures were prepared using 


PyMOLJ. Superimpositions of the various structures were 
carried out using ALIGN, 51,52 based on the main-chain 
atoms (amide N, C a , and carbonyl C and O). The surface 
areas of the structures were calculated using N ACCESS § 
Dimer interactions and M^a(-i) : APE interactions were ana¬ 
lyzed using DIMPLOT' 53 and LIGPLOT, 53 respectively. 

Protein Data Bank accession codes 

The atomic coordinates and the structure factors of all 
the structures have been deposited in the RCSB Protein 
Data Bank. The accession code is 2GT7 for the structure of 
the unbound SARS-CoV M pro , 2GT8 for the structure of 
the unbound M£a(-i)/ and 2GTB for the structure of the 
M5a(-i)-APE complex. 
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